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4 <110> APPLICANT: Genencor International, Inc. 

5 Dunn -Coleman, Nigel 

6 Ward, Michael 

8 <12 0> TITLE OF INVENTION: BGL6 Beta-Glucosidase and Nucleic Acids 

9 Encoding the Same 

11 <130> FILE REFERENCE: GC781-2-PCT 

C--> 13 <140> CURRENT APPLICATION NUMBER: US/10/530,556 

C--> 14 <141> CURRENT FILING DATE: 2005-04-07 

16 <150> PRIOR APPLICATION NUMBER: US 60/424,784 

17 <151> PRIOR FILING DATE: 2002-11-07 
19 <160> NUMBER OF SEQ ID NOS : 4 

21 <170> SOFTWARE: Fast SEQ for Windows Version 4.0 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 2812 

2 5 <212> TYPE: DNA 

26 <213> ORGANISM: Trichoderma reesei 

28 <400> SEQUENCE: 1 

29 gatcacaccc ctcccaccct tctcttttca aggttgtccc cttctcccac ggctttatgt 60 

30 acttcccact ctmtaattcg ctctttccat tccaagccaa gcaacatctg tgagcagctc 120 

31 atccttccca atatgggcga atggcaggag cagatgatgg gttttgacgt ggaggatgtt 180 

32 ctgtctcagc tgagccaaaa tgagaagatt gctctcttgt ccggcattga tttctggcat 240 

33 acttatccca taccaaagta caacgtccct tcagtccgcc taacggacgg tcctaacggc 300 

34 atacgaggca caaagttttt tgctggcatt cctgctgcct gcctgccatg tgggacggcc 360 

3 5 ctggcctcta cctgggataa gcagctgctg aagaaggctg ggaagctgct cggtgatgag 42 0 

36 tgcatcgcaa aaggcgccca ctgctggctg ggcccaacaa tcaatactcc ccgatctcct 480 

37 ctgggggggc gcggcttcga gtcattttcg gaagatccgt acctgtccgg catccttgct 540 

38 gcatctatga ttctcggctg tgaaagcaca ggtgtcatct ctgccgtcaa acactttgtc 600 
3 9 gccaacgacc aggagcacga gcggcgagcg gtcgactgtc tcatcaccca gcgggctctc 660 

40 cgggaggtct atctgcgacc cttccagatc gtagcccgag atgcaaggcc cggcgcattg 720 

41 atgacatcct acaacaaggt caatggcaag cacgtcgctg acagcgccga gttccttcag 780 

42 ggcattctcc ggactgagtg gaattgggat cctctcattg tcagcgactg gtacggcacc 840 

43 tacaccacta ttgatgccat caaagccggc cttgatctcg agatgccggg cgtttcacga 900 

44 tatcgcggca aatacatcga gtctgctctg caggcccgtt tgctgaagca gtccactatc 960 

45 gatgagcgcg ctcgccgcgt gctcaggttc gcccagaagg ccagccatct caaggtctcc 1020 

46 gaggtagagc aaggccgtga cttcccagag gatcgcgtcc tcaaccgtca gatctgcggc 1080 

47 agcagcattg tcctactgaa gaatgagaac tccatcttac ctctccccaa gtccgtcaag 1140 

48 aaggtcgccc ttgttggatc ccacgtgcgt ctaccggcta tctcgggagg aggcagcgcc 12 00 

49 tctcttgtcc cttactatgc catatctcta tacgatgccg tctctgaggt actagccggt 1260 

50 gccacgatca cgcacgaggt cggtgcctat gcccaccaaa tgctgcccgt catcgacgca 1320 

51 atgatcagca acgccgtaat ccacttctac aacgacccca tcgatgtcaa agacagaaag 1380 

52 ctccttggca gtgagaacgt atcgtcgaca tcgttccagc tcatggatta caacaacatc 1440 

53 ccaacgctca acaaggccat gttctggggt actctcgtgg gcgagtttat ccctaccgcc 1500 

54 acgggaattt gggaatttgg cctcagtgtc tttggcactg ccgaccttta tattgataat 1560 
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55 gagctcgtga ttgaaaatac aacacatcag acgcgtggta ccgccttttt 

56 acgacggaaa aagtcgctac caggaggatg gtggccggca gcacctacaa 

57 gagtttgggt ctgccaacac gaccaagatg gagacgaccg gtgttgtcaa 

58 ggtgccgtac acctgggtgc ctgtctcaag gtcgacccac aggagatgat 

5 9 gtcaaggccg cagccgatgc cgactacacc atcatctgca cgggactcag 

60 gagtctgagg gttttgaccg gcctcacatg gacctgcccc ctggtgtgga 

61 tcgcaagttc ttgacgccgc tcccaatgct gtagtcgtca accagtcagg 

62 acaatgagct gggctcataa agcaaaggcc attgtgcagg cttggtatgg 

63 acaggccacg gaatctccga tgtgctcttt ggcaacgtca acccgtcggg 

64 ctatcgtggc cagtcgatgt gaagcacaac ccagcatatc tcaactacgc 

65 ggacgggtct tgtatggcga ggatgtttac gttggctaca agttctacga 

66 agggaggttc tgtttccttt tgggcatggc ctgtcttacg ctaccttcaa 

67 tctaccgtga ggacggtccc cgaaaccttc cacccggacc agcccacagt 

6 8 aagatcaaga acacgagcag tgtcccgggc gcccaggtcc tgcagctata 
6 9 ccaaactcgc ctacacatcg cccggtcaag gagctgcacg gattcgaaaa 

70 gaagctggcg aggagaagga ggtacaaata cccattgacc agtacgctac 

71 gacgagattg agagcatgtg gaagagcgag aggggcattt atgatgtgct 

72 tcgagtcagg aaatctcggg caaggggaag ctgattgtgc ctgaaacgcg 

73 gggctgtaga ttcaacacgt gagcaaaagc gattgcggaa agtaccagaa 

74 agtcaaagga tgggaacttg tgtcaataga agatatgcat gatgggcatt 

75 tttggcatta tgcaaagaag caaagatgga gtgataaaaa aaaaaaaaaa 

77 <210> SEQ ID NO: 2 

78 <211> LENGTH: 838 

79 <212> TYPE: PRT 

80 <213> ORGANISM: Trichoderma reesei 
82 <400> SEQUENCE: 2 
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100 130 135 140 

101 Ser Thr Gly Val He Ser Ala Val Lys His Phe Val Ala Asn Asp Gin 

102 145 150 155 160 

103 Glu His Glu Arg Arg Ala Val Asp Cys Leu He Thr Gin Arg Ala Leu 

104 165 170 175 

105 Arg Glu Val Tyr Leu Arg Pro Phe Gin He Val Ala Arg Asp Ala Arg 



cggaaaggga 162 0 

gctgcgtctc 1680 

ctttggcggc 1740 

tgcgcgggcc 1800 

cggcgagtgg 1860 

caccatgatc 1920 

caccccagtg 1980 

tggtaacgag 2040 

gaaactctcc 2100 

cagcgttggt 2160 

caaaacggag 222 0 

gctcccagat 2280 

agccattgtc 2340 

catttcggcc 2400 

ggtgtatctt 2460 

tagcttctgg 2520 

tgtaggattc 2580 

attctggatg 2640 

aagccaaggg 2700 

tgggatggtg 2760 

aa 2812 
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206 aacaaggtca atggcaagca cgtcgctgac agcgccgagt tccttcaggg cattctccgg 660 

207 actgagtgga attgggatcc tctcattgtc agcgactggt acggcaccta caccactatt 720 

208 gatgccatca aagccggcct tgatctcgag atgccgggcg tttcacgata tcgcggcaaa 780 
2 09 tacatcgagt ctgctctgca ggcccgtttg ctgaagcagt ccactatcga tgagcgcgct 840 

210 cgccgcgtgc tcaggttcgc ccagaaggcc agccatctca aggtctccga ggtagagcaa 900 

211 ggccgtgact tcccagagga tcgcgtcctc aaccgtcaga tctgcggcag cagcattgtc 960 

212 ctactgaaga atgagaactc catcttacct ctccccaagt ccgtcaagaa ggtcgccctt 1020 

213 gttggatccc acgtgcgtct accggctatc tcgggaggag gcagcgcctc tcttgtccct 1080 

214 tactatgcca tatctctata cgatgccgtc tctgaggtac tagccggtgc cacgatcacg 1140 

215 cacgaggtcg gtgcctatgc ccaccaaatg ctgcccgtca tcgacgcaat gatcagcaac 1200 

216 gccgtaatcc acttctacaa cgaccccatc gatgtcaaag acagaaagct ccttggcagt 1260 

217 gagaacgtat cgtcgacatc gttccagctc atggattaca acaacatccc aacgctcaac 1320 

218 aaggccatgt tctggggtac tctcgtgggc gagtttatcc ctaccgccac gggaatttgg 1380 

219 gaatttggcc tcagtgtctt tggcactgcc gacctttata ttgataatga gctcgtgatt 1440 

220 gaaaatacaa cacatcagac gcgtggtacc gcctttttcg gaaagggaac gacggaaaaa 1500 

221 gtcgctacca ggaggatggt ggccggcagc acctacaagc tgcgtctcga gtttgggtct 1560 

222 gccaacacga ccaagatgga gacgaccggt gttgtcaact ttggcggcgg tgccgtacac 1620 

223 ctgggtgcct gtctcaaggt cgacccacag gagatgattg cgcgggccgt caaggccgca 1680 

224 gccgatgccg actacaccat catctgcacg ggactcagcg gcgagtggga gtctgagggt 1740 

225 tttgaccggc ctcacatgga cctgccccct ggtgtggaca ccatgatctc gcaagttctt 1800 

226 gacgccgctc ccaatgctgt agtcgtcaac cagtcaggca ccccagtgac aatgagctgg 1860 

227 gctcataaag caaaggccat tgtgcaggct tggtatggtg gtaacgagac aggccacgga 1920 

228 atctccgatg tgctctttgg caacgtcaac ccgtcgggga aactctccct atcgtggcca 1980 

229 gtcgatgtga agcacaaccc agcatatctc aactacgcca gcgttggtgg acgggtcttg 2040 
2 30 tatggcgagg atgtttacgt tggctacaag ttctacgaca aaacggagag ggaggttctg 2100 

231 tttccttttg ggcatggcct gtcttacgct accttcaagc tcccagattc taccgtgagg 2160 

232 acggtccccg aaaccttcca cccggaccag cccacagtag ccattgtcaa gatcaagaac 2220 

233 acgagcagtg tcccgggcgc ccaggtcctg cagctataca tttcggcccc aaactcgcct 2280 

234 acacatcgcc cggtcaagga gctgcacgga ttcgaaaagg tgtatcttga agctggcgag 2340 

235 gagaaggagg tacaaatacc cattgaccag tacgctacta gcttctggga cgagattgag 2400 

236 agcatgtgga agagcgagag gggcatttat gatgtgcttg taggattctc gagtcaggaa 2460 

237 atctcgggca aggggaagct gattgtgcct gaaacgcgat tctggatggg gctgtag 2517 

239 <210> SEQ ID NO: 4 

240 <211> LENGTH: 831 

241 <212> TYPE: PRT 

242 <213> ORGANISM: Trichoderma reesei 

244 <400> SEQUENCE: 4 

245 Met Met Gly Phe Asp Val Glu Asp Val Leu Ser Gin Leu Ser Gin Asn 

246 15 10 15 

247 Glu Lys lie Ala Leu Leu Ser Gly lie Asp Phe Trp His Thr Tyr Pro 

248 20 25 30 

249 lie Pro Lys Tyr Asn Val Pro Ser Val Arg Leu Thr Asp Gly Pro Asn 

250 35 40 45 

251 Gly He Arg Gly Thr Lys Phe Phe Ala Gly He Pro Ala Ala Cys Leu 

252 50 55 60 

2 53 Pro Cys Gly Thr Ala Leu Ala Ser Thr Trp Asp Lys Gin Leu Leu Lys 

254 65 70 75 80 

255 Lys Ala Gly Lys Leu Leu Gly Asp Glu Cys He Ala Lys Gly Ala His 

256 85 90 95 
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